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Abstract: Severe acute respiratory syndrome (SARS) first appeared in 2002 in China, which fastly affected about 8000 
patients over 29 countries and caused 774 fatalities. As its pathogen was identified as a new kind of coronavirus 
(SARS_CoV), its genome was quickly sequenced on several isolates. Studies on its functional genomics were performed 
by combinatorial application of all the available bioinformatics tools and the development of new programs. In this way, 
it was found that the four proteins were absolutely responsible for nosogenesis of SARS, i.e. spike (S) protein; small en¬ 
velop (E) protein; membrane (M) protein; and nucleocaspid (N) protein. Molecular evolution studies have revealed that 
SARS must be originated from wild animals, and it was demonstrated that the major genetic variations in some critical 
genes, particularly the Spike gene, was essential for the transition from animal-to-human transmission to human-to- 
human transmission. Theoretical models, either Logistic model or SIR model, were developed to describe the transmis¬ 
sion of SARS. The recorded difference of SARS spreading in Beijing and Hong Kong was also reasonably analyzed ac¬ 
cording to these models. The whole process of fruitful bioinformatics studies, along with other related scientific investi¬ 
gations have set up an unprecedented paradigm for human of how to battle against sudden-breaking and catastrophic epi¬ 
demics. 
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INTRODUCTION 

Severe acute respiratory syndrome (SARS) first emerged 
in November, 2002, in Guangdong Province, Southern 
China, and then was identified as a new coronavirus in 
March, 2003 LI, 2]. Later it was known that a new pathogen, 
a member of the Coronaviridae family of enveloped, POSI¬ 
TIVE-STRANDED RNA VIRUS, the SARS_CoV was 
origin of epidemic. Less than half year later, it had affected 
over 8000 patients in 29 countries with 774 fatalities. It was 
an unprecedented global experience in the rapidity and ex¬ 
tent of its spreading, the magnitude of its impact on the 
health systems and economies, and in the effectiveness of its 
control [3]. 

After SARS broke out, Chinese government took aggres¬ 
sive public health measures to bring SARS under control, 
but at that time there were no effective drugs or vaccines 
against SARS, not even drastic debating on the pathogen 
identification. Control of this disease only relied on the rapid 
identification of cases and their appropriate management, 
including the isolation of suspect and probable cases and the 
management of their close contacts. By these very strict 
isolations it was impossible to get enough samples for labo¬ 
ratory experiments at the early stage of epidemic. On the 
other hand, there were no safe experimental equipments 
suitable for such virology study. Most of the life science 
research laboratories in China could not perform their initial 
studies about SARS_CoV, therefore, scientists in China had 
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no opportunity to be the first to nail the pathogen, sequence 
its genomes, or describe how it sickens its victims based on 
the laboratory works. Although at the first stage, Chinese 
scientists were “defeated” [4], the researchers determined to 
catch-up and many institutes are now abuzz with SARS 
research projects after Chinese government had embraced 
science as a key weapon against the disease. Researches in 
various areas had been processed, and bioinformatics was 
used as one of the practical tools. Under such crucial situa¬ 
tion, “Bioinformatics” studies were started, and now its 
important role is generally recognized. These studies have 
given a strong support to the campaigns against SARS in 
China, and benefited worldwide battle against this disease. 
Several hundreds of bioinformatics based or related scien¬ 
tific papers were published around the time when SARS 
emerged, and a many wide research fields were involved, 
including: the molecular biology of viruses, the pathogenic 
mutation, nosogenesis, the channels for the spread of the 
pathogeny, epitope prediction and immunity mechanisms, 
drug design, vaccine design and vaccination development, 
early diagnosis and treatment of the SARS disease, and its 
prevention. Generally, these studies can be grouped into 
three aspects of SARS_CoV research: functional genomics, 
which was focused on gene prediction and functional 
mechanisms of gene products; molecular evolution, which 
was mainly about genotype identify, evolution distance; and 
the transmission model, which could be used to predict the 
SARS’ spread capability and help to give the evaluation to 
the effect of controls, related adjustment. In the following 
sections, we will try to summarize the latest advances of 
these aspects; also some of our own work is included. 
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FUNCTIONAL GENOMICS STUDIES ON SARS_COV 

Just after the whole genome of SARS_CoV was se¬ 
quenced by scientists in Canada on 13 April, 2003[5], the 
complete genome sequence of another isolate of SARS, 
BJ01, was sequenced by Chinese scientists at BG1 (Beijing 
Genomics Institute, CAS) [6], and 11 ORFs (open-reading 
frame) were identified through ORF Finder (http://www. 
ncbi.nlm.nih.gov/projects/gorf/). The physical and chemical 
features of proteins were predicted by using Compute 
Pl/MW (http://us.expasy.org/tools/pi_tool.html) and also the 
N-Glycosylation sites were predicted by applying NetNGlyc 
1.0 (http://www.cbs.dtu.dk/services/NetNGlyc/). Sooner or 
later. Qin et al. completely sequenced the genome of the 
GD01 isolate [7], and the isolates of BJ group (Isolates 
BJ01, BJ02, BJ03 and BJ04) were sequenced by BGI (8]. 
Additional two ORFs were identified later. All these re¬ 
searches carried multi-sequence alignment with the other 
isolates of SARS_CoV, and the mutations among different 
virus genomes were found. Furthermore, the multi-sequence 
alignment with other coronavirus genomes confirmed the 
early hypothesis that the SARS_CoV may be originated 
from animals [7, 8]. 

By applying several prediction methods, some groups 
put more efforts to predict new ORFs in order to complete 
SARS_CoV genome annotation. Through the comparison 
and evaluation of 12 ORFs by different prediction methods, 
Chen et al. selected 4 of them [9]: Heuristic models, gene 
identification and ORF finder, doing gene prediction of 
SARS_CoV genome. The ATGpr program, which is the start 
coden prediction software based on a linear discriminant 
algorithm, is usually used to calculate and estimate the prob¬ 
ability of each ATG being the initiation codon [10]. Chen et 
al. used this program to verify possibilities of start codon 
and to see whether the Kozak rule [11] was followed. Tak¬ 
ing advantage of this combinatory gene finding strategy, 21 
new proteins were predicted and the appearance possibilities 
were analyzed by BLAST and FASTA with other sequences 
in nr database, and all these informations were extracted for 
further evaluations [12]. 

Developing new algorithms about mining biological data 
and knowledge extraction is always the most active work of 
bioinformatics. In searching of novel genes from SARS_ 
CoV genome with higher accuracy and sensitivity, more 
efforts were also put into developing of new gene prediction 
algorithms (as shown in Fig. 1). Zhang developed a new 
program based on the famous Z curve theory of DNA se¬ 
quence LI3], and named it as ZCURVE_CoV (version 1.0) 
[14]. This program can be used to recognize protein coding 
genes in coronavirus genomes with highly accurate gene 
start prediction, and particularly suitable for the prediction 
of SARS_CoV genomes. Comparing with some other avail¬ 
able tools, this new program package has the merits of sim¬ 
plicity, higher accuracy, higher reliability, and quickness. 
Zhang’s group used ZCURVE_CoV system for each of the 
11 newly sequenced SARS_CoV genomes, and got good 
results. For example, annotating six genomes which were 
not annotated previously, checking and discussing of some 
problems on previous annotations of other five genomes. 
Besides the polyprotein chain ORFs la and lb and the four 
genes coding for the major structural proteins (i.e. spike (S) 


protein, small envelop (E) protein, membrane (M) protein 
and nuleocaspid (N) protein). ZCURVE_CoV also predicted 
5-6 putative proteins in length between 39 and 274 amino 
acids with unknown functions. Some single nucleotide mu¬ 
tations within these putative coding sequences have been 
detected and their biological implications were discussed. 
The improved version 2.0 of this program was applied to 
identify all the non-structural proteins cleaved by viral pro- 
teinases in the polyproteins [15]. The ZCURVE_CoV pro¬ 
gram can be accessed through its web server (http://tubic.tju. 
edu.cn/sars) and user can obtain the annotated results very 
quickly by pasting sequences. This program can also be 
freely downloaded from its website and run locally. 

In order to clarify the mechanism of the nosogenesis of 
SARS disease, more and more researches focused on the 
annotation of protein functions. HMMER and BLIMPS were 
employed to identify the conserved domains in PFAM [16] 
(http://www.sanger.ac.uk/Software/Pfam/) and BLOCKS 
[17] (http://blocks.fhcrc.org), respectively. The SI and S2 
domains of the Spike protein were correctly predicted [9]. 
Other domains were analyzed and their functions were pre¬ 
dicted by synthesizing different alignment and prediction 
methods. PS1PRED [18] was employed to predict corre¬ 
sponding protein secondary structures [12]. The transmem¬ 
brane segment and the transmembrane helices of the protein 
M were predicted by using TMAP [19] and TMHMM2.0 
[20]. The sub-cellular localization of proteins was predicted 
by PSORT [21]. Other programs for analyzing protein fea¬ 
tures including SAPS [22], PI and EXTCOEF [23] were 
widely used. These studies together with the achievements 
by researchers all over the world demonstrate that four pro¬ 
teins and the main protease (3C-like proteinase) of R protein 
[24] are the most important proteins associated with 
SARS_CoV infection, these are the S protein, the E protein, 
the M protein and the N protein. Based on the complete 
genome sequences for four isolates (BJ01-BJ04) of SARS_ 
CoV from Beijing, China, Yang et al. analyzed the structure 
and predicted functions of the R protein by comparing with 
13 other isolates of SARS_CoV and other coronavirus [24]. 
Rota and Marra et al. [5, 25] obtained very similar results by 
annotation. The entire ORF encodes for two major enzyme 
activities, RNA-dependent RNA polymerase (RdRp) and 
proteinase activities, namely the main proteinase 3CLP. 
They found the entire 15 function-related peptides, which 
were deemed from a complex proteolytic process of R poly¬ 
protein. Of 15 peptides, a hydrophobic domain (HOD) and a 
hydrophilic domain (HID) were newly identified within 
NSP1 of ORFlab of SARS_CoV. Eleven highly conserved 
regions in RdRp and twelve cleavage sites by chymotrypsin- 
like protein (3CLP) had been identified as potential drug 
binding sites. 

Since antisense RNA and RNA interference (RNAi) 
technologies have shown potential prospect in treating some 
severe diseases, it was considered as important candidate 
medicines against SARS. To help the design of such drugs, 
the prediction of probable genomic packaging signal of 
SARS_CoV was carried out on the comparison of genomic 
packaging signals of MHV and BCoV with both primary 
and secondary structure [26]. The primary sequence multi¬ 
alignment was carried out with Vector NTI [27] and secon¬ 
dary structure of genomic RNA sequences were predicted by 
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Fig. (1). (A) General pipeline for the annotation of SARS genome, 
nome. 

using the RNA Structure [28]. It was suggested that probable 
genomic packaging signal of SARS_CoV was analogous to 
that of MHV and BCoV, with the corresponding secondary 
RNA structure locating at the similar region of ORFlb, but 
the positions for genomic packaging signals must have suf¬ 
fered rounds of mutations, and inversely these mutations 
may influence the primary structures of the N and M pro¬ 
teins. 


. (B) A detailed operating process for the gene annotation of SARS ge- 


Instead of sequence based functional annotation, a num¬ 
ber of research groups focused on the aspects of immu¬ 
nological mechanism of SARS_CoV infection. Among mul¬ 
tiple steps of this approach, cell epitopes prediction is the 
first and the key for the preparation of antibodies of SARS_ 
CoV. The combination of Kyte-Doolittle [29], Ernini [30] 
and Jameson-Wolf [31] methods to predict the B-Cell epi¬ 
topes is the one most frequently used. Goldkey developed by 
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Chinese Academy of Military Medical Sciences was em¬ 
ployed to predict the B-Cell epitopes in the S and M proteins 
of SARS_CoV [32]. Another group predicted the epitopes of 
M and E proteins of isolates tor2 by Protean [33]. Combin¬ 
ing at least three parameters, Lv et al. also considered the 
secondary structure of the M protein to get a more reason¬ 
able result [34]. The epitopes in M protein were predicted by 
these three groups are similar to one another. Of all the 9 
epitopes predicted, the pairwise rate of agreement is up to 
88.9%. A combination method with artificial neural network 
and quantitative matrix was developed and predicted six 
HLA-A* CTL epitope candidates in SARS_CoV N protein 
[35]. Integrating for analyzing the CD 13 binding sites in 
SARS_CoV spike protein [36]. Yu et al. used a series of 
methods including comparative genomics, homology search, 
phylogenetic analyses, and multi-sequence alignment for 
sequence similarity comparison and structure comparison. 
They identified several domains and motifs responsible for 
CD 13 binding at the possible binding site in the S protein of 
SARS_CoV [36]. This information provided indicative clues 
for the function study of SARS proteins and the design of 
anti-SARS drugs and vaccines. Further molecular modeling 
and molecular docking addressing the interacting features 
between CD 13 and S protein of SARS_CoV validated the 
bioinformatics predictions. They found that the P585-A653 
domain of CD13 was pairing with the D757-R761 motif of 
the SARS_CoV S protein to form the complex, so the CD 13 
may be a possible receptor of the SARS_CoV S protein, and 
its binding may result in the SARS infection. This study also 
provided a practically useful strategy for mapping the possi¬ 
ble binding receptors of the proteins in a genome. The com¬ 
plete sequence determination of the human genome marks 
the start of a new era in biological sciences, with a focus 
shifting from sequencing to functional mechanisms of gene 
products. Afterwards, a number of works were focused on 
the genome based drug discovery. This change was also 
introduced to the anti-SARS research. Just after the whole 
genome sequence of SARS_CoV was reported [5, 6], some 
research groups tried to follow this change for the finding of 
candidate drug targets and for the design of potential drugs. 
Luo et al. [37] studied the interactions between the N protein 
of SARS_CoV (SARS_NP) and human cyclophilin A 
(hCypA) based on the work of Yu, et al. [38]. Nucleocapsid 
protein (NP) of SARS_CoV (SARS_NP) functions in envel¬ 
oping the entire genomic RNA and interacts with viron 
structural proteins, thus playing important roles in the proc¬ 
ess of virus particle assembly and release. Protein-protein 
interaction analysis using bioinformatics tools indicated that 
SARS_NP may bind to human cyclophilin A (hCypA), and 
measurements by surface plasmon resonance (SPR) technol¬ 
ogy found that the equilibrium dissociation constant of this 
binding ranged from 6 to 160nM [37]. The probable binding 
sites of these two proteins were investigated by modeling the 
three-dimensional structure of the SARS_NP-hCypA com¬ 
plex, from which revealed the important interaction residue 
pairs between the proteins. Site-directed mutagenesis ex¬ 
periments were carried out for validating the binding model, 
whose correctness was assessed by the observed effects on 
the binding affinities between the proteins. The derived 
binding site is reliable as there is a good agreement between 
the SPR data and the computationally predicted mutual bind¬ 
ing free energies for the binding of SARS-NP (or hCypA) 


mutants with the wild-type hCypA (or SARS-NP). This new 
derived SARS-NP—hCypA interaction model might hint 
another possible SARS_CoV infection pathway against 
human cell. The whole study has set a good paradigm of 
how to find a possible candidate drug target stalling from the 
whole genome information to the wet experimental test. 

To study the role of individual proteins, 3D structural 
modeling is a practical way, for example, the S protein of 
SARS_CoV [31]. The 3D structure of SARS_S1B was pre¬ 
dicted by Homology modeling based on the structure of 
template proteins(PDB entries 1AOF and 1NIR) [39]. The 
structural model was assessed by PROCHECK [32] for 
stereochemical quality, by Profile-3D [39] for structure- 
sequence compatibility, and by WHATIF program [40] for 
the rationality of the predicted protein-protein interactions 
between hAPN receptor and the S protein of SARS_CoV 
[36]. The experimentally measured data of SI-ACE-2 bind¬ 
ing [39] was reasonably explained according to the con¬ 
structed model. 

MOLECULAR EVOLUTION OF SARS_COV 

In order to escape from host defense, RNA viruses, such 
as coronavirus, commonly have a high rate of genetic muta¬ 
tion, and therefore evolve into novel viral strains. Similarly, 
the mutation rate of the SARS_CoV is a key factor to know 
how it spreads through the population. On the other hand, if 
it is possible to find a clear date for the last common ances¬ 
tor of SARS coronavirus strains, it should be useful to un¬ 
derstand the circumstances surrounding the emergence of the 
SARS pandemic and the rate at which SARS_CoV diverges. 
Based on this purpose, quite a few research groups per¬ 
formed their molecular evolution researches and tried to 
answer two questions: the origins and the spread of SARS_ 
CoV. 

Zhao et al. sequenced a large number of samples derived 
from patients in different time, different areas, and paid 
much more attention in search of the genotype characteris¬ 
tics of early, middle and late phase [41] (Fig. 2). In their 
work, sequences that were derived directly from the pa¬ 
tients’ clinical specimens were used for statistical analysis. 
Sequences generated from specimens collected more than 4 
weeks after disease onset were excluded. Among all of the 
available sequences, only 10 (GZ02, CUHK-AG01, CUHK- 
AG02, GZ-C, GZ-D, HZS2-A, HZS2-Fb, HSZ-A, HSZ-Bb, 
HSZ-Cb) of them met all the criteria. For the phylogenetic 
analysis, GZ02 was selected as the out-group, since it was 
the most divergent from all of the remaining 9 sequences. 
The Pamilo-Bianchi-Li model [42, 43] was used to calculate 
the Ks (the rates of synonymous changes) for the 6 known 
concatenated coding sequences (orfla. orflb, S, E, M, and 
N) of the SARS_CoV genome. In order to find the differ¬ 
ence of evolutionary rates among different stages of the 
epidemic, all of samples were divided into four types: 
HP03E (human patient in early stage, 2002-2003), PC03 
(palm civet, 2002-2003), PC04 (palm civet, 2003-2004) and 
HP04 (human patient, 2003-2004) for the estimation of the 
neutral mutation rate and the date. The MRCA illustrated 
that the SARS_CoV evolved in both palm civet and human 
[44]. It is clearly indicated that the PC03 and PC04 were not 
in the same primary transmission lineage. This further dem- 
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onstrated that SARS was from unknown origin and evolved 
not only in humans but also in palm civet hosts [44]. 
Through exhaustive calculations, they finally detected that 
the neutral mutation rate of the viral genome was constant, 
but the amino acid substitution rate of the coding sequences 
was slowed during the course of the epidemic. They also 
compared SARS_CoV from patients in different stages with 
animal SARS-like coronaviruses from palm civets, and 
found that the earliest genotypes for human SARS_CoV 
were very similar to the animal SARS-like coronaviruses. 
Major genetic variations in some critical genes, particularly 
the Spike gene, seemed essential for the transition from 
animal-to-human transmission to human-to-human transmis¬ 
sion, which eventually caused the first severe acute respira¬ 
tory syndrome outbreak of 2002-2003 [41], 

As mentioned above, the genotypes of SARS_CoV 
originated from human and palm civet were much alike. 
This indicates that the origin of SARS was possibly from 
wild animal. To clarify this mystery, high-throughput ge¬ 
nome sequencing based on a large number of samples was 
performed, and a huge amount of sequence data was gener¬ 
ated [41]. By the calculations using bioinformatics tools, a 


very interesting phenomenon was detected. At the early 
stage of epidemic, only two main genotypes existed; in the 
middle stage of epidemic, more and more different geno¬ 
types emerged because of selection pressure; but in the late 
stage of epidemic, a dominant genotype was formed and 
kept at a very low evolutionary rate. This shows that the 
evolutionary ability can restrict lethal mutations of SARS_ 
CoV and can also reduce mutated gene number related with 
forming new phenotypes. 

In view of methodology, this work was the first achieve¬ 
ment of combining genome sequencing, molecular evolu¬ 
tion, computational biology and bioinformatics in the study 
of virology and epidemic. This story also shows that 
multidisciplinary cooperation is needed for the research on 
the battle against SARS_CoV. 

Many other computational biology tools have been used 
to explore the source of SARS_CoV. Researchers used 
FDOD, PHYLIP for calculating the distance based on the 
whole genome sequence, and predicted that SARS_CoV 
shared the origin with Porcine epidemic diarrhea virus [45]. 
Some other researchers used PSI_BLAST for exploring the 
origin of SARS, and concluded that SARS_CoV should be 



Fig. (2). Genotype clustering of SARS-CoV covering the epidemics from 2002 to 2004 illustrated by (A) an unrooted phylogenetic tree con¬ 
structed with complete SNVs and deletions of 91 sequences from the human patient-derived viruses (HP) and 5 sequences from the palm 
civet-derived viruses (PC) and (B) a neighbor-joining (N-J) tree for the consensus nucleotide sequences of PC and early individual transmis¬ 
sion lineages of HP. (Adopted from Song H.D. et al. 2005 [44]). 
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closer to Bovine coronavirus [12]. In addition, Lu et al. also 
proposed a new mathematical model to estimate the evolu¬ 
tion rate of the SARS_CoV genome as well as the time of 
the last common ancestor of the various SARS_CoV strains 
[46]. Based on 6 strains with accurate dates of host death, 
they estimated a time of the last common ancestor and an 
evolutionary rate in the same range as that was reported for 
the HIV-1 virus [46]. 

On the other hand, sequence structure and features have 
also been studied. For example, codon preference were 
studied, and it was found that SARS’s preference was more 
similar to Eukaryota than to Prokaryota, and suggested that 
Eukaryota such as yeast is better than others to be used to 
express SARS genes [47]. Dinucleotide signature of the 
Genome of SARS Coronavirus was analyzed and revealed 
that SARS coronavirus was close to the Group I coronavirus 
(Human coronavirus. Porcine epidemic diarrhea vims, etc.), 
especially transmissible guest roenteritis vims [48]. 

TRANSMISSION MODELS OF SARS 

Many models were put forward to describe the transmis¬ 
sion of SARS, Most of them can be grouped into either defi¬ 
nite model or random model. Logistic model is the simplest 
definite model. Zheng et al. built the following Logistic 
model [49, 50] to describe the SARS: 

— =rN-kN 2 =N(r-kN ) 0) 

dt 


Where N is the number of accumulated infected patients, t is 
the day, k is death rate, which not only depends on the 
SARS vims itself, but also on the effect of prevention and 
control. It has a direct influence on the epidemic, disease- 
lasting time, disease-peak appearance time and accumulated 
patients’ number. R means disease incidence rate, and larger 
r means that the disease-development varies faster, and in¬ 
creasing rapidly from initial period to peak time. The larger 
k means that the patients’ number is smaller at the peak 
time. 

SIR Model is a random model put forward by Chen and 
Huang et al. [51-53]. 

Pr\dN(t) = \,dR(t) = 01 F,} = 0 S(t)f(t)dt + o(dt) (3) 

Pr [dN{t) = 0, dR{t) = 11 F t } = yl(t)dt + o{dt ) (4) 

Pr[diV(t) = 0 ,dR(t) = OIF,} = 1-0 S(t)I(t)dt-yI(t) + o(dt) (5) 

In above equations, N(t) is the accumulated number of 
infected patients at time t, I(t) is the number of infectable 
persons, S(t) is the number of susceptible persons, R(t) is 
cured patients and dead patients, N(t) is total number. 
S(t) = S(t)/n , is the proportion of susceptible number to 
total number. 

Equation (3) describes the probability of an additional 
infector in a very short time period after time t. It shows that 
this probability has positive correlation to I(t) and S(t) ■ 0 is 
the probability of healthy person getting sick after contacting 
with an infectable person, thus it is the transmission rate 


from susceptible to infected, y in equation (4) is the recipro¬ 
cal of average infecting time, which is the possibility from 
infected state to the dead or the cured, thus considered as the 
removal rate. p. (p=0/y) is the basic reproduction rate, and 
can reflect the ability of how many people inducted as pa¬ 
tients can be inducted during the induction period. When p 
is smaller than 1, inducted population will be small and the 
infectious disease will disappear by itself. However, when p 
is larger than 1, the infectious disease will spread continu¬ 
ously and induct more persons. 

Chen et al. [51] compared the SARS data about the 
number of infected and latent people in Beijing with Hong 
Kong. It was found that Hong Kong’s 0 value was smaller 
than Beijing, and the decreasing speed for p value to 1 was 
faster than that of Beijing, 

It should be pointed out that the two models are well 
fitted from the real epidemic kinetics of SARS. The logistic 
model is more powerful for the description of the guard 
effect, while the SIR model is better for the description of 
the SARS’ spread capability without any disturbance. With 
the help of these simple mathematical models, the govern¬ 
ment and the related organizations can forecast the number 
of people that will be infected by SARS and how long SARS 
will continue. What’s more, with the evaluation to the effect 
of controls, related adjustment can be applied. 

CONCLUDING REMARK S 

Three years ago, the initial cases of SARS appeared in 
the Guangdong Province in the South China. Afterwards, 
this terrible epidemic started to spread around the world. It is 
the first wild spreading and easy-infected disease of this new 
century, especially of its unusually high morbidity and mor¬ 
tality rates. It had taken full advantage of the opportunities 
provided by a world of international travel. At that time, 
China had no effective drugs or usable vaccines against 
SARS, so the control of this terrible disease had to rely on 
the rapid identification of susceptible cases and effective 
management of the discovered cases, including the isolation 
of the suspects with their close contacts. These measures 
prevented imported cases from spreading the disease to 
others, but also brought disadvantage for the scientific re¬ 
search at laboratory level. 

As soon as the genome sequence of SARS_CoV was 
released from NCBI, many research groups in China took 
tools in bioinformatics in combination with a variety of 
other practical ways to battle against SARS in trying to first 
find the origin of SARS_CoV, and then studied the function 
of its genes, protein-protein interactions, possible epitopes, 
protein structures and the search of drug target candidates. 
Most of the bioinformatics researches as described above are 
fruitful and meaningful. It is no doubt that “Bioinformatics” 
research works helped China to gain time for further bio¬ 
medical experiments against SARS, and in general it also 
provided a useful scientific paradigm on how bioinformatics 
research can work together with other areas in the face of 
sudden-breaking and catastrophic epidemics. 

Recently, the specific science meeting held in Washing¬ 
ton D. C of USA heard that the strain of SARS_CoV, which 
was jumping readily between humans, may initially exist in 
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laboratory samples. This would be an unhappy accident. 
There may be a fresh mutation of the virus in animal host for 
this re-emerging. And even if it did, it could be quickly 
contained (http://news.bbc.co.Uk/l/hi/health/4280253.stm). 
As a whole, all the researches on SARS_CoV are certainly 
helpful in the future for the worldwide battle against other 
spreading human diseases caused by unknown coronavi- 
ruses. 
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